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Albanian language 


Albanian (/ael' belnian/; shqip, [ f Cq ip] or gjuha shqipe, [f juha ' fcqipE] ) is an Indo- 
European language spoken by the Albanians in the Balkans and the Albanian diaspora in the 
Americas, Europ e and Oceania / 1 !! 9 ] With about 7.5 million speakers/ 2 ^ 1 ! it comprises an 
independent branch within the Indo-European languages and is not closely related to any 
other language/ 10 ! 

First attested in the 15th century, it is the last Indo-European branch to appear in written 
records. This is one of the reasons why its still-unknown origin has long been a matter of 
dispute among linguists and historians/ 10 ! Albanian is considered to be the descendant of one 
of the Paleo-Balkan languages of antiquity. For more historical and geographical reasons than 
specifically linguistic ones, there are various modern historians and linguists who believe that 
the Albanian language may have descended from a southern Illyrian dialect! 11 ! spoken in 
much the s ame region in classical times. Alternative hypotheses hold that Albanian may have 
descended from Thracian or Daco-Moesian, other ancient languages spoken farther east than 
Illyrian/ 10 !! 12 ! Not enough is known of these languages to completely prove or disprove the 
various hypotheses/ 13 ! 

The two main Albanian dialects, Gheg and Tosk, are primarily distinguished by phonological 
differences and are mutually intelligible/ 14 ^ 15 ^ with Gheg spoken to the north and Tosk 
spoken to the south of the Shkumbin river/ 14 ] Their characteristics! 16 ! in the treatment of 
both native words and loanwords indicate the dialectal split occurred after Christianisation of 
the region (4th century AD) and at the time of the Slavic migration to the Balkans/ 17 ^ 10 ] 
with the historic boundary between Gheg and Tosk being the Shkumbin] 19 ] which straddled 
the Jirecek line/ 20 ]] 21 ] Standard Albanian is a standardised form of spoken Albanian based 
on the Tosk dialect. It is the official language of Albania and Kosovo] 2 ] and a co-official 
language in North Macedonia as well as a minority language of Italy, Montenegro , Romania 
and Serbia. 

Centuries-old communities speaking Albanian dialects can be found scattered in Croatia (the 
Arbanasi), Greece (the Arvanites and some communities in Epirus, Western Macedonia and 
Western Thrace)/ 22 ] Italy (the Arbereshe)! 23 ! as well as in Romania, Turkey and Ukraine/ 24 ] 
Two varieties of the Tosk dialect, Arvanitika in Greece and Arberesh in Southern Italy, have 
preserved archaic elements of the language/ 14 ] 
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The dialects of the Albanian language. 
(The map does not indicate where the language is 
majority or minority.) 


The language is spoken by approximately 6 million people in the Balkans, primarily in 
Albania, Kosovo, North Macedonia, Serbia, Montenegro and Greece.^ However, due to old 
communities in Italy and the large Albanian diaspora, the worldwide total of speakers is much higher 
than in Southern Europe and numbers approximately 7.5 million/ 2 ^ 1 ! 


Europe 

The Albanian language is the official language of Albania and Kosovo and co-official in North 
Macedonia. Albanian is a recognised minority language in Croatia, Italy , Montenegro , Romania and in 
Serbia. Albanian is also spoken by a minority in Greece, specifically in the Thesprotia and Preveza 
regional units and in a few villages in Ioannina and Fiorina regional units in Greece.^ It is also spoken 
by 450,000 Albanian immigrants in Greece. 

Albanian is the third most spoken language in ItalyJ 25 ^ This is due to a substantial Albanian 
immigration to Italy. Italy has a historical Albanian minority of about 500,000, scattered across Southern 
Italy, known as Arbereshe. Approximately 1 million Albanians from Kosovo are dispersed throughout 
Germany, Switzerland and Austria. These are mainly immigrants from Kosovo who migrated during the 
1990s. In Switzerland, the Albanian language is the sixth most spoken language with 176,293 native 
speakers. 



Albanian became an official language in North Macedonia on 15 January 2019.t 26 ^ 


The dialects of Albania 


Americas 

There are large numbers of Albanian speakers in the United States, Argentina , Chile, Uruguay and Canada. Some of the first ethnic 
Albanians to arrive in the United States were Arbereshe. Arbereshe have a strong sense of identity and are unique in that they speak an 
archaic dialect of Tosk Albanian called Arbereshe. 

In North America (United States and Canada), there are approximately 250,000 Albanian speakers. It is spoken in the eastern area of the 
United States in cities like New York City, New Jersey, Boston, Chicago, Philadelphia and Detroit, as well as in parts of the states of Ohio 
and Connecticut. Greater New Orleans has a large Arberesh community. Oftentimes, wherever there are Italians, there are a few Arbereshe 
mixed with them. Arbereshe Americans, therefore are often indistinguishable from Italian Americans due to being assimilated into the Italian 
American community. 





















In Argentina, there are nearly 40,000 Albanian speakers, mostly in Buenos Aires.^ 


Asia and Oceania 

Approximately 1.3 million people of Albanian ancestry live in Turkey, and more than 500,000 recognizing their ancestry, language and 
culture. There are other estimates, however, that place the number of people in Turkey with Albanian ancestry and or background upward to 
5 million. However, the vast majority of this population is assimilated and no longer possesses fluency in the Albanian language, though a 
vibrant Albanian community maintains its distinct identity in Istanbul to this day. 

In Egypt there are around 18,000 Albanians, mostly Tosk speakers Many are descendants of the Janissary of Muhammad Ali Pasha, an 
Albanian who became Wali, and self-declared Khedive of Egypt and Sudan. In addition to the dynasty that he established, a large part of the 
former Egyptian and Sudanese aristocracy was of Albanian origin. In addition to the recent emigrants, there are older diasporic communities 
around the world. 

Albanian is also spoken by Albanian diaspora communities residing in Australia and New Zealand. 

Dialects 


The Albanian language has two distinct dialects, Tosk which is spoken in the south, and Gheg spoken in 
the north.^ Standard Albanian is based on the Tosk dialect. The Shkumbin River is the rough dividing 
line between the two dialectsJ 30 ^ 


Ghe g is divided into four sub-dialects, in Northwest Gheg, Northeast Gheg, Central Gheg, and Southern 
Gheg. It is primarily spoken in northern Albania and throughout Montenegro , Kosovo and northwestern 
North Macedonia. One fairly divergent dialect is the Upper Reka dialect, which is however classified as 
Central Gheg. There is also a diaspora dialect in Croatia, the Arbanasi dialect. 

Tosk is divided into five sub-dialects, including Northern Tosk (the most numerous in speakers), Laberisht, Gam , Arvanitika, and Arberesh. 
Tosk is spoken in southern Albania, southwestern North Macedonia and northern and southern Greece. Cham Albanian is spoken in North¬ 
western Greece, while Arvanitika is spoken by the Arvanites in southern Greece. In addition, Arberesh is spoken by the Arbereshe people, 
descendants of 15th and 16th century migrants who settled in southeastern Italy, in small communities in the regions of Sicily and 
Calabria, 

Orthography 



The dialects of the Albanian 
language. 


The Albanian language has been written using many different alphabets since the earliest records from 
the 14th century. The history of Albanian language orthography is closely related to the cultural 
orientation and knowledge of certain foreign languages among Albanian writersJ 32 ^ The earliest written 
Albanian records come from the Gheg area in makeshift spellings based on Italian or Greek. Originally, 
the Tosk dialect was written in the Greek alphabet and the Gheg dialect was written in the Latin script. 

Both dialects had also been written in the Ottoman Turkish version of the Arabic script, Cyrillic , and 
some local alphabets (Elbasan, Vithkuq i, Todhri, Veso Bey, Jan Vellara and others, see original Albanian 
alphabets). More specifically, the writers from northern Albania and under the influence of the Catholic Church used Latin letters, those in 
southern Albania and under the influence of the Greek Orthodox church used Greek letters, while others throughout Albania and under the 
influence of Islam used Arabic letters. There were initial attempts to create an original Albanian alphabet during the 1750-1850 period. 
These attempts intensified after the League of Prizren and culminated with the Congress of Manastir held by Albanian intellectuals from 14 
to 22 November 1908, in Manastir (present day Bitola), which decided on which alphabet to use, and what the standardized spelling would 
be for standard Albanian. This is how the literary language remains. The alphabet is the Latin alphabet with the addition of the letters <e>, 
<g>, and ten digraphs : dh, th, xh, gj, nj, ng, //, rr, zh and sh. 

According to Robert Elsie:^ 



Albanian keyboard layout. 


The hundred years between 1750 and 1850 were an age of astounding orthographic diversity in Albania. In this period, the 
Albanian language was put to writing in at least ten different alphabets - most certainly a record for European languages. ... the 
diverse forms in which this old Balkan language was recorded, from the earliest documents to the beginning of the twentieth 
century ... consist of adaptations of the Latin, Greek, Arabic, and Cyrillic alphabets and (what is even more interesting) a 
number of locally invented writing systems. Most of the latter alphabets have now been forgotten and are unknown, even to the 
Albanians themselves.^ 


Classification 


The Albanian language occupies an independent branch of the Indo-European language treej 34 ^ In 1854, Albanian was demonstrated to be 
an Indo-Europ e an languag e by the philolo gist Franz Bopp . Albanian was formerly compared by a few Indo-European linguists with 
Germanic and Balto-Slavic, all of which share a number of isoglosses with Albanian.^ 35 ] Other linguists linked the Albanian language with 














































Latin, Greek and Arme nian, while placing Germanic and Balto-Slavic in another branch of Indo- 
European.' 36][37][38] 

History 


The first written mention of the Albanian language was on 14 July 1284 in Dubrovnik in modern 
Croatia when a crime witness named Matthew testified: "I heard a voice shouting on the mountainside 
in the Albanian language" ( Latin : Audivi unam vocem, clamantem in monte in lingua a/banesca).' 39 " 40 ' 

The oldest document written in Albanian dates back to 1462,' 10 ' while the first audio recording in the 
language was made by Norbert Jokl on 4 April 1914 in Vienna.' 41 ' 

During the five-century period of the Ottoman presence in Albania , the language was not officially 
recognized until 1909, when the Congress of Dibra decided that Albanian schools would finally be 
allowed.' 42 ' 

Linguistic affinities 

Albanian is considered an isolate within the Indo-European language family; no other language has been conclusively linked to its branch. 
The only other languages that are sole surviving members of a branch of Indo-European are Armenian and Greek. 

The Albanian language is part of the Indo-European lan guage group and is considered to have evolved from one of the Paleo-Balkan 
languages of antiquity,' 43 " 44 " 45 ^ although it is still uncertain which particular Paleo-Balkan language represents the ancestor of Albanian, or 
where in Southern Europe that population lived.' 46 ' In general there is insufficient evidence to connect Albanian with one of those 
languages, whether one of the Illyrian languages (which historians mostly confirm), or Thracian and Dacian.' 47 ' Among these possibilities, 
Illyrian is typically held to be the most probable, though insufficient evidence still clouds the discussion.' 48 ' 

Although Albanian shares lexical isoglosses with Greek, Germanic, and to a lesser extent Balto-Slavic, the vocabulary of Albanian is quite 
distinct. In 1995, Taylor, Ringe and Warnow, using quantitative linguistic techniques, found that Albanian appears to comprise a "subgroup 
with Germanic". However, they argued that this fact is hardly significant, as Albanian has lost much of its original vocabulary and 
morphology, and so this "apparently close connection to Germanic rests on only a couple of lexical cognates - hardly any evidence at 
ah".' 49 ' 



Historical presence and location 

The place and the time where the Albanian language was formed 
is uncertain.' 50 ' American linguist Eric Hamp stated that during 
an unknown chronological period a pre-Albanian population 
(termed as "Albanoid" by Hamp) inhabited areas stretching from 
Poland to the southwestern Balkans.' 51 ' Further analysis has 
suggested that it was in a mountainous region rather than on a 
plain or seacoast:' 52 ' while the words for plants and animals 
characteristic of mountainous regions are entirely original, the 
names for fish and for agricultural activities (such as p loughi ng) 
are borrowed from other languages.' 53 ' 

A deeper analysis of the vocabulary, however, shows that this could be a consequence of a prolonged 
Latin domination of the coastal and plain areas of the country, rather than evidence of the original environment where the Albanian language 
was formed. For example, the word for 'fish' is borrowed from Latin, but not the word for 'gills', which is native. Indigenous are also the 
words for 'ship', 'raft', 'navigation', 'sea shelves' and a few names of fish kinds, but not the words for 'sail', 'row' and 'harbor' - objects 
pertaining to navigation itself and a large part of sea fauna. This rather shows that Proto-Albanians were pushed away from coastal areas in 
early times (probably after the Latin conquest of the region) thus losing large parts (or the majority) of sea environment lexicon. A similar 
phenomenon could be observed with agricultural terms. While the words for 'arable land', 'corn', 'wheat', 'cereals', 'vineyard', 'yoke', 
'harvesting', 'cattle breeding', etc. are native, the words for 'ploughing', 'farm' and 'farmer', agricultural practices, and some harvesting tools 
are foreign. This, again, points to intense contact with other languages and people, rather than providing evidence of a possible Urheimat. 

The centre of Albanian settlement remained the Mat river. In 1079, they were recorded farther south in the valley of the Shkumbin riverJ 54 ^ 
The Shkumbin, a seasonal stream that lies near the old Via Egnatia , is approximately the boundary of the primary dialect division for 
Albanian, Tosk and Gheg. The characteristics of Tosk and Gheg in the treatment of the native and loanwords from other languages are 
evidence that the dialectal split preceded the Slavic migration to the Balkans, 1 55 ][ 56 1[ 57 1 which means that in that period (the 5th to 6th 
centuries AD), Albanians were occupying nearly the same area around the Shkumbin river, which straddled the Jirecek Line J 58 ^ 52 ^ 

References to the existence of Albanian as a distinct language survive from the 14th century, but they failed to cite specific words. The oldest 
surviving documents written in Albanian are the "formula e pagezimit" (Baptismal formula), Un'te paghesont' pr'emenit t'Atit e t'Birit e 
t'Spertit Senit. ("I baptize thee in the name of the Father, and the Son, and the Holy Spirit") recorded by Pal Engjelli, Bishop of Durres in 
1462 in the Gheg dialect, and some New Testament verses from that period. 



The location of the Albanoi tribe 
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Linguists Stefan Schumacher and Joachim Matzinger (University of Vienna) assert that the first 
literary records of Albanian date from the 16th century, t 59 ^ 60 ^ The oldest known Albanian printed 
book, Meshari, or "missal", was written in 1555 by Gjon Buzuku, a Roman Catholic cleric. In 1635 
Frang Bardhi wrote the first Latin-Albanian dictionary. The first Albanian school is believed to have 
been opened by Franciscans in 1638 in Pdhane. 

One of the earliest dictionaries of Albanian language was written in 1693 which was an Italian 
language manuscript authored by Montenegrin sea captain Julije Balovic Pratichae Schrivaneschae 
and includes a multilingual dictionary of hundreds of the most often used words in everyday life in the 
Italian, Slavo-Illirico, Greek, Albanian and Turkish languages.^ 


Pre-Indo-European substratum 

Pre-Indo-European (PIE) sites are found throughout the territory of Albania. Such PIE sites existed in 
Maliq, Vashtem, Burimas, Barq, Dersnik in Korqe District, Kamnik in Kolonja, Kolsh in Kukes 
District, Rashtan in Librazhd and Nezir in Mat District.As in other parts of Europe, these PIE 
people joined the migratory Indo-European tribes that entered the Balkans and contributed to the 
formation of the historical Paleo-Balkan tribes. In terms of linguistics, the pre-Indo-European substrate 
language spoken in the southern Balkans has probably influenced pre-Proto-Albanian, the ancestor 
idiom of Albanian The extent of this linguistic impact cannot be determined with precision due to 
the uncertain position of Albanian among Paleo-Balkan lang uages and their scarce attestation 
Some loanwords, however, have been proposed such as shege (" pomegranate ") or lepjete ("orach", compare with Pre-Greek lapathon, 
XanaOov, "monk's rhubarb")^ 64 ^ 62 ^ 



1905 issue of the magazine 
Albania, the most important 
Albanian periodical of the early 
20th century 


Proto-IE features 

Although Albanian has several words that do not correspond to IE cognates, it has retained many proto-IE features: for example, the 
demonstrative pronoun * ki- is ancestral to Albanian ky/kjo, English he, and Russian sej but not to English this or Russian etot. 

Albanian is compared to other Indo-European languages below, but note that Albanian has exhibited some notable instances of semantic drift 
(such as moter meaning "sister" rather than "mother". 


Vocabulary of Albanian and other Indo-European languages 


Albanian 

muaj 

ri 

nene 

moter 

nate 

hunde 

tr el 
tri 

zi 

kuq 

verdhe 

kalter 

ujk 

Proto- 

Indo- 

European 

*mehins- 

*neu- 

(i)o- 

*meh 2 ter 

*swesor 

*nok"ts 

*neh 2 - 

s- 

*treies 

*k“rsnos 

*mel-n- 

*hireud h - 
6- ~ 

hiroud h - 

6- 

*g h elh 3 - 

*b h lehi- 

uo- 

*wlk“os 

English 

month 

new 

mother 

sister 

night 

nose 

three 

black 

red 

yellow 

blue 

wolf 

Latin 

mensis 

novus 

mater 

soror 

noct- 

nasus 

tres 

ater, 

niger 

ruber 

helvus 

flavus 

lupus 

Lithuanian 

menuo / 
menesis 

naujas 

mote / 
motina 

sesuo 

naktis 

nosis 

trys 

juodas 

raudas / 
raudonas 

geltas / 
geltonas 

melynas 

vilkas 

Old 

Church 

Slavonic 

MkcALJb 

mes^cb 

HOBb 

novb 

MaTH 

mati 

cecTpa 

sestra 

HOLUTb 

nostb 

HOCb 

nosb 

TpH, 

TpHl£ 
tri, trije 

qpbHb 

crbnb 

qpbBeHb 

crbvenb 

>KJIbTb 

Zlbtb 

CHHb 

sinb 

BJIbKb 

vlbkb 

Ancient 

Greek 

II 

veoc; 

neos 

MrjTfip 

meter 

aSsAcpn 

adelphe 

VUKI- 

nukt- 

piv- 

rhin- 

xpeic; 

treTs 

jJEXac; 

melas 

spuGpoc; 

eruthros 

^avGoc; 

xanthos 

Kuavoc; 

kyanos 

Aukoc; 

lykos 

Armenian 

LUlIjlU 

amis 

bnp 

nor 

tfiujp 

mayr 

pnLjp 

k'uyr 

9h2tip 

giser 

ehp 

k'it 

bpbp 

yerek' 

ubL 

sev 

l|LUpiJ|ip 

karmir 

p.br\[ib 

degin 

MiuiqnLjin 

kapuyt 

qur)[ 

gayl 

Irish 

mi 

nua 

mdthair 

deirfiur 

oiche 

sron 

tri 

dubh 

dearg 

bui 

gorm 

faolchu 

Sanskrit 

*TO 

masa 

^cr 

nava 

Hlc£ 

matr 

icRJ 

svasr 

nrft/ftST 

nakta/nis 

TO 

nasa 

ft 

tri 

cbld/<£UJ| 

kala/krsna 

sfsR 

rudhira 

d)d/l?<ut| 

pTta/hiranya 

nlla 

vrka 


Albanian-PIE phonological correspondences 

Phonologically, Albanian is not so conservative. Like many IE stocks, it has merged the two series of voiced stops (e.g. both *d and *d h 
became d). In addition, voiced stops tend to disappear in between vowels. There is almost complete loss of final syllables and very 
widespread loss of other unstressed syllables (e.g. mik 'friend' from Lat. amicus). PIE *o appears as a (also as e if a high front vowel i 
follows), while *e and *d become o, and PIE *o appears as e. 

















































The palatals, velars, and labiovelars show distinct developments, with Albanian showing the three-way distinction also found in 
LuwianJ 65 ^ 66 ^ Labiovelars are for the most part differentiated from all other Indo-European velar series before front vowels, but they merge 
with the "pure" (back) velars elsewhereThe palatal velar series, consisting of Proto-Indo-European *k and the merged *g and g h , 
usually developed into th and dh, but were depalatalized to merge with the back velars when in contact with sonorantsj 65 ^ Because the 
original Proto-Indo-European tripartite distinction between dorsals is preserved in such reflexes, Albanian is therefore neither centum nor 
satem, despite having a "satem-like" realization of the palatal dorsals in most casesj 66 ^ Thus PIE *k, *k, and *k" become th, q, and s, 
respectively (before back vowels * k becomes th, while *k and *k " merge as k). 

A minority of scholars reconstruct a fourth laryngeal *h 4 allegedly surfacing as Alb. h word-initially, e.g. Alb. herdhe 'testicles' presumably 
from PIE *h 4 6rg h i -{ 67 ! (rather than the usual reconstruction *h 3 erg h i-), but this is generally not followed elsewhere, as h- as arisen 
elsewhere idiosyncratically (for example hark < Latin arcus) .! 68 1[69] 


Reflexes of PIE bilabial plosives in Albanian 


PIE 

Albanian 

PIE 

Albanian 

*P 

P 

*pek“- 'to cook' 

pjek 'to bake' 

*b h lb 

b 

*srob h -eie- 'to sip, gulp' 

gjerb 'to sip' 


Reflexes of PIE coronal plosives in Albanian 


PIE 

Albanian 

PIE 

Albanian 

*t 

t 

*tuh 2 'thou' 

ti 'you (singular)' 

*d 

d 

*dih 2 tis 'light' 

dite 'day' 


dh!* i] 

*perd- 'to fart' 

pjerdh 'to fart' 


g 

*dlh!-t6- 'long 1 

gjate 'long' (Tosk dial, glate) 

*d h 

d 

*d h eg“ h - 'burn' 

djeg 'to burn' 


dh!* i] 

*g h ord h os 'enclosure' 

gardh 'fence' 


1. Between vowels or after r 


Reflexes of PIE palatal plosives in Albanian 


PIE 

Albanian 

PIE 

Albanian 

*k 

th 

*keh 1 smi ’1 say 1 

them '1 say' 


si*!] 

*kupo- 'shoulder 1 

sup 'shoulder' 


kt* 2 l 

*smek-r 'chin' 

mjeker 'chin; beard' 


q/c!* 3 1 

*kentro- 'to stick' 

gander 'prop' 

*9 

dh 

*gdmb h os 'tooth, peg 1 

dhemb 'tooth' 

*g h J 

dh 

*g h ed-ioH ’1 defecate' 

dhjes '1 defecate' 

_ 

d [* 4 ] 

*g h rsd h i 'grain, barley' 

drithe 'grain 1 


1. Before u/u or i/i 

2. Before sonorant 

3. Archaic relic 

4. Syllable-initial and followed by sibilant 


Reflexes of PIE velar plosives in Albanian 


1 1 

PIE 

Albanian 

PIE 

Albanian 

*k 

k 

*kag h mi '1 catch, grasp' 

kam '1 have' 


q 

*kluH-io- 'to weep' 

qaj 'to weep, cry 1 (dial, kla(n)j) 

*g 

g 

*h 3 Hgos 'sick' 

lige 'bad' 


gj 

*h!reug- 'to retch' 

regj 'to tan hides' 

*gh 

g 

*g h ord h os 'enclosure' 

gardh 'fence' 


gj 

*g h edn-ie/o- 'to get 1 

gjej 'to find' (Old Alb. gjanj) 







Reflexes of PIE labiovelar plosives in Albanian 


PIE 

Albanian 

PIE 

Albanian 

*k" 

k 

*k "eh 2 sleh 2 'cough’ 

kolle 'cough' 


s 

*k"elH- 'to turn 1 

sjell 'to fetch, bring' 


q 

*k"od 

qe 'that, which' 

*g" 

g 

*g "rH 'stone' 

gur 'stone' 


z 

*g "reh 2 us 'heavy 1 

zor 'hard, difficult' 

*gwh 

g 

*d h eg " h - 'to burn' 

djeg 'to burn 1 


z 

*d h og " h eie - 'to ignite' 

ndez ’to kindle, light a fire’ 


Reflexes of PIE *s in Albanian 


PIE 

Albanian 

PIE 

Albanian 

*s 

gj [ * 11 

*sekstis 'six' 

gjashte 'six' 


hi* 2 ] 

*nosom 'us' (gen.) 

nahe 'us' (dat.) 


shl* 3 1 

*b h reusos 'broken' 

bresher 'hail 1 


th [*4] 

*sub 2 s 'swine' 

thi 'pig' 


0 

hiesmi ’1 am' 

jam '1 am' 

*-sd- 

th 

*g“esdos 'leaf 

gjeth 'leaf 

*-sk- 

h 

*sh-eh 2 'shadow' 

hije 'shadow' 

*-sp- 

f 

*spelnom 'speech' 

fjale 'word' 

*-st- 

sht 

*h 2 osti 'bone' 

asht 'bone' 

*-sy- 

d 

*sgeid-r- 'sweat' 

dirse 'sweat' 


1. Initial 

2. Between vowels 

3. Between u/i and another vowel (ruki law) 

4. Dissimilation with following s 


Reflexes of PIE sonorants in Albanian 


PIE 

Albanian 

PIE 

Albanian 

*i 

gj [ * 11 

*ieh 3 s- 'to gird' 

(n)gjesh '1 gird; squeeze, knead' 


j [<21 

*iuH 'you' (nom.) 

ju 'you (plural)' 


0[*3] 

*treies ’three' (masc.) 

tre 'three' 

*y 

V 

*gos-eie- 'to dress' 

vesh 'to wear, dress' 

*m 

m 

*meh 2 tr-eh 2 'maternal' 

moter 'sister' 

*n 

n 

*nos 'we' (acc.) 

ne 'we' 


nj 

*eni-hidi-no 'that one' 

nje 'one' (Gheg nya, ny'o, nyV) 


0 (Tosk) ~ nasal vowel (Gheg) 

*penk“e 'five' 

pese 'five' (vs. Gheg pes) 


r (Tosk only) 

*g h eimen 'winter' 

dimer 'winter' (vs. Gheg dimen) 

*1 

1 

*h 3 llgos 'sick' 

lige 'bad' 


II 

*k w elH- 'turn' 

sjell 'to fetch, bring' 

*r 

r 

*repe/o 'take' 

rjep 'peel' 


rr 

*urh 1 Sn 'sheep' 

rrunje 'yearling lamb' 

*n 

e 

*h 1 nmen 'name' 

emer 'name' 

*m 

e 

*uikipti 'twenty' 

(njejzet 'twenty' 

*1 

li, i|I* 4 l/lu, ul 

*u)k"os 'wolf 

uy/c'wolf (Chamian ulk ) 

*r 

ri, irl* 4 I / ru, ur 

*g h fsdom 'grain, barley' 

drithe 'grain' 


1. Before i, e, a 






2. Before back vowels 

3. Between vowels 

4. Before C clusters, i, j 

Reflexes of PIE laryngeals in Albanian 


PIE 

Albanian 

PIE 

Albanian 

_IT_ 

*hij 

0 

*hiesmi '1 am' 

jam 'to be' 

*h 2 

0 

*h 2 itkos 'bear' 

ari 'bear' 

*h 3 

0 

*h 3 onr' dream' 

enderr 'dream' 


Reflexes of PIE vowels in Albanian 


PIE 

Albanian 

PIE 

Albanian 

r*i 

i 

*slnos 'bosom' 

gji 'bosom, breast' 


e 

*dwig h eh 2 'twig' 

dege 'branch 1 

*1 < *iH 

i 

*dih 2 tis 'light' 

dite 'day' 

*e 

e 

*penk“e 'five' 

pese 'five 1 (Gheg pes) 


je 

*w/efos 'year' (loc.) 

vjet 'last year' 

*e 

0 

*g h esreh 2 'hand' 

dore 'hand' 

*a 

a 

*b h akeh 2 'bean' 

bathe 'bean' 


e 

*h 2 elb h it 'barley' 

elb 'barley' 

*0 

a 

*g h ord h os 'enclosure' 

gardh 'fence 1 

*0 

e 

*h 2 oktotis 'eight' 

fete 'eight' 

*u 

u 

*supnom 'sleep' 

gjume 'sleep' 

*u < *uhll 

y 

*suHsos 'grandfather' 

gjysh 'grandfather' 


i 

*muh 2 s 'mouse' 

mi 'mouse' 


Standard Albanian 


Since World War II, standard Albanian used in Albania has been based on the Tosk dialect. Kosovo and other areas where Albanian is 
official adopted the Tosk standard in 1969. ^ 


Elbasan-based standard 

Until the early 20th century, Albanian writing developed in three main literary traditions: Ghe g, Tosk, and Arbereshe. Throughout this time, 
an intermediate subdialect spoken around Elbasan served as lingua franca among the Albanians, but was less prevalent in writing. The 
Congress of Manastir of Albanian writers held in 1908 recommended the use of the Elbasan subdialect for literary purposes and as a basis of 
a unified national language. While technically classified as a southern Gheg variety, the Elbasan speech is closer to Tosk in phonology and 
practically a hybrid between other Gheg subdialects and literary Tosk.'- 70 -' 

Between 1916 and 1918, the Albanian Literary Commission met in Shkoder under the leadership of Luigj Gurakuqi with the purpose of 
establishing a unified orthography for the language. The commission, made up of representatives from the north and south of Albania, 
reaffirmed the Elbasan subdialect as the basis of a national tongue. The rules published in 1917 defined spelling for the Elbasan variety for 
official purposes. The Commission did not, however, discourage publications in one of the dialects, but rather laid a foundation for Gheg and 
Tosk to gradually converge into one J 70 ^ 

When the Congress of Lushnje met in the aftermath of World War I to form a new Albanian government, the 1917 decisions of the Literary 
Commission were upheld. The Elbasan subdialect remained in use for administrative purposes and many new writers embraced for creative 
writing. Gheg and Tosk continued to develop freely and interaction between the two dialects increased. 


Tosk standard 

At the end of World War II, however, the new communist regime radically imposed the use of the Tosk dialect in all facets of life in Albania: 
administration, education, and literature. Most Communist leaders were Tosks from the south.' 70 ' Standardization was directed by the 
Albanian Institute of Linguistics and Literature of the Academy of Sciences of Albania.' 71 ' Two dictionaries were published in 1954: an 
Albanian language dictionary and a Russian-Albanian dictionary. New orthography rules were eventually published in 1967' 71 ' and 1973 
Drejtshkrimi i gjuhes shqipe (Orthography of the Albanian Language).' 72 ' 






Until 1968, Kosovo and other Albanian-speaking areas in the former Yugoslavia followed the 1917 standard based on the Elbasan dialect, 
though it was gradually infused with Gheg elements in an effort to develop a Kosovan language separate from communist Albania's Tosk- 
based standard.] 73 ] Albanian intellectuals in the former Yugoslavia consolidated the 1917 twice in the 1950s, culminating with a thorough 
codification of orthographic rules in 1964J 74 ^ The mles already provided for a balanced variety that accounted for both Gheg and Tosk 
dialects, but only lasted through 1968. Viewing divergences with Albania as a threat to their identity, Kosovars arbitrarily adopted the Tosk 
project that Tirana had published the year before. Although it was never intended to serve outside of Albania, the project became the "unified 
literaiy language" in 1972, when approved by a rubberstamp Ort hography Congress J 70 ^ Only about 1 in 9 participants were from Kosovo. 
The Congress, held at Tirana, authorized the orthography rules that came out the following year, in 1973. 

More recent dictionaries from the Albanian government are Fjalori Drejtshkrimor i Gjuhes Shqipe (1976) (Orthographic Dictionary of the 
Albanian Language)] 75 ] and Dictionary of Today's Albanian language (Fjalori Gjuhes se Sotme Shqipe) (1980). 1 71 H 76 J Prior to World War II, 
dictionaries consulted by developers of the standard have included Lexikon tis Alvanikis glossis (Albanian: Fjalori i Gjuhes Shqipe 
(Kostandin Kristoforidhi, 1904),^ Fjalori i Bashkimit (1908),^ and Fjalori i Gazullit (1941)J 32 ^ 


Calls for reform 

Since the fall of the communist regime, Albanian orthography has stirred heated debate among scholars, writers, and public opinion in 
Albania and Kosovo, with hardliners opposed to any changes in the orthography, moderates supporting varying degrees of reform, and 
radicals calling for a return to the Elbasan dialect. Criticism of Standard Albanian has centred on the exclusion of the 'me+' infinitive and the 
Gheg lexicon. Critics say that Standard Albanian disenfranchises and stigmatizes Gheg speakers, affecting the quality of writing and 
impairing effective public communication. Supporters of the Tosk standard view the 1972 Congress as a milestone achievement in Albanian 
history and dismiss calls for reform as efforts to "divide the nation" or "create two languages." Moderates, who are especially prevalent in 
Kosovo, generally stress the need for a unified Albanian language, but believe that the 'me+' infinitive and Gheg words should be included. 
Proponents of the Elbasan dialect have been vocal, but have gathered little support in the public opinion. In general, those involved in the 
language debate come from diverse backgrounds and there is no significant correlation between one's political views, geographic origin, and 
position on Standard Albanian. 

Many writers continue to write in the Elbasan dialect but other Gheg variants have found much more limited use in literature. Most 
publications adhere to a strict policy of not accepting submissions that are not written in Tosk. Some print media even translate direct speech, 
replacing the 'me+' infinitive with other verb forms and making other changes in grammar and word choice. Even authors who have 
published in the Elbasan dialect will frequently write in the Tosk standard. 

In 2013, a group of academics for Albania and Kosovo proposed minor changes to the orthography.] 7 ®] Hardline academics boycotted the 
initiative,] 79 ] while other reformers have viewed it as well-intentioned but flawed and superficial.] 78 ] Media such as Rrokum and Java have 
offered content that is almost exclusively in the Elbasan dialect. Meanwhile, author and linguist Agim Morina has promoted Shqipe e 
Perbashket or Common Albanian, a neostandard or a reformed version of the Tosk standard that aims at reflecting the natural development 
of the language among all Albanians.]®®]] 70 ] Common Albanian incorporates the 'me+' infinitive, accommodates for Gheg features, provides 
for dialect-neutral rules that favor simplicity, predictability, and usage trends. 1 81 ]1 82 J Many modern writers have embraced Common 
Albanian to various extents, especially in less formal writing.]® 3 ] 


Education 

Albanian is the medium of instruction in most Albanian schools. The literacy rate in Albania for the total population, age 9 or older, is about 
99%. Elementary education is compulsory (grades 1-9), but most students continue at least until a secondary education. Students must pass 
graduation exams at the end of the 9th grade and at the end of the 12th grade in order to continue their education. 

Phonology 


Standard Albanian has 7 vowels and 29 consonants. Like English, Albanian has dental fricatives /0/ (like the th in thin) and /3/ (like the th in 
this), written as th and dh, which are rare cross-linguistically. 

Gheg uses long and nasal vowels, which are absent in Tosk, and the mid-central vowel e is lost at the end of the word. The stress is fixed 
mainly on the last syllable. Gheg n (femen: compare English feminine) changes to r by rhotacism in Tosk ( femer ). 


Consonants 






Labial 

Dental 

Alveolar 

Post- 

Palatal 

Velar 

Glottal 



plain 

velar. 

alveolar 

Nasal 

m 


n 



P 

(n) 


Plosive 

voiceless 

P 


t 




k 


voiced 

b 


d 




g 


Affricate 

voiceless 



ts 


t~J 

c? 



voiced 



dz 


d3 

n 



Fricative 

voiceless 

f 

0 

s 


I 



h 

voiced 

v 

a 

z 


3 




Approximant 



1 

1 


i 



Flap 



r 






Trill 



r 







IPA 

Description 

Written as 

English approximation 

m 

Bilabial nasal 

m 

man 

n 

Alveolar nasal 

n 

not 

p 

Palatal nasal 

nj 

-onion 

n 

Velar nasal 

ng 

bang 

p 

Voiceless bilabial plosive 

P 

spin 

b 

Voiced bilabial plosive 

b 

bat 

t 

Voiceless alveolar plosive 

t 

stand 

d 

Voiced alveolar plosive 

d 

debt 

k 

Voiceless velar plosive 

k 

scar 

g 

Voiced velar plosive 

g 

go 

ts 

Voiceless alveolar affricate 

C 

hats 

dz 

Voiced alveolar affricate 

X 

goods 

tjr 

Voiceless postalveolar affricate 

p 

chin 

<?3 

Voiced postalveolar affricate 

xh 

jet 

c? 

Voiceless palatal affricate 

q 

-china (RP) 

n 

Voiced palatal affricate 

gj 

-gem (RP) 

t 

Voiceless labiodental fricative 

f 

far 

V 

Voiced labiodental fricative 

V 

van 

e 

Voiceless dental fricative 

th 

thin 

a 

Voiced dental fricative 

dh 

then 

S 

Voiceless alveolar fricative 

s 

son 

z 

Voiced alveolar fricative 

z 

zip 

J 

Voiceless postalveolar fricative 

sh 

show 

3 

Voiced postalveolar fricative 

zh 

vision 

h 

Voiceless glottal fricative 

h 

hat 

r 

Alveolar trill 

rr 

Spanish perro 

r 

Alveolar tap 

r 

Spanish pero 

i 

Alveolar lateral approximant 

i 

lean 

i 

Velarized alveolar lateral approximant 

ii 

ball 

j 

Palatal approximant 

j 

yes 


Notes: 

■ The contrast between flapped r and trilled rr is the same as in Spanish or Armenian. In most of the dialects, as also in 
standard Albanian, the single "r" changes from an alveolar flap /r/ into a retroflex flap [j], or even an alveolar approximant 



[j] when it is at the end of a word. 

■ The palatal nasal /p/ corresponds to the Spanish n and the French and Italian gn. It is pronounced as one sound, not a 
nasal plus a glide. 

■ The II sound is a velarised lateral, close to English dark L. 

■ The letter g is sometimes written ch due to technical limitations because of its use in English sound and its analogy to the 
other digraphs xh, sh, and zh. Usually it is written simply c or more rarely q with context resolving any ambiguities. 

■ The position of q and gj sound is not clear. Many speakers merge them into the palatoalveolar sounds g and xh. This is 
especially common in Northern Gheg, but is increasingly the case in Tosk as well.! 84 ] Other speakers reduced them into /j/ in 
consonant clusters, such as in the word fjolle, which before standardization was written as fqolle ( < Medieval Greek 
(paKioApc;)- 

■ The ng can pronounced as Iql in final position, otherwise is an allophone of n before k and g. 

■ Before q and gj, the n is always pronounced Ipl but this is not reflected in the orthography. 

■ /0, 3, 1/ are interdental. 


Vowels 


IPA 

Description 

Written as 

English approximation 

i 

Close front unrounded vowel 

i 

seed 

£ 

Open-mid front unrounded vowel 

e 

bed 

a 

Open central unrounded vowel 

a 

cow 

0 

Schwa 

e 

about, the 

D 

Open-mid back rounded vowel 

0 

law 

y 

Close front rounded vowel 

y 

French tu, German Luge 

u 

Close back rounded vowel 

u 

boot 


Schwa 

Although the Indo-European schwa (9 or -h 2 -) was preserved in Albanian, in some cases it was lost, possibly when a stressed syllable 
preceded it.'- 85 -' Until the standardization of the modern Albanian alphabet, in which the schwa is spelled as e, as in the work of Gjon Buzuku 
in the 16th century, various vowels and gliding vowels were employed, including ae by Leke Matrenga and e by Pjeter Bogdani in the late 
16th and early 17th century. ^ 86 ^ 87 ^ The schwa in Albanian has a great degree of variability from extreme back to extreme front 
articulation.t 8 ®] Within the borders of Albania, the phoneme is pronounced about the same in both the Tosk and the Gheg dialect due to the 
influence of standard Albanian. However, in the Gheg dialects spoken in the neighbouring Albanian-speaking areas of Kosovo and North 
Macedonia, the phoneme is still pronounced as back and roundedJ 88 ^ 

Grammar 


Albanian has a canonical word order of SVO (subject-verb-object) like English and many other Indo-European languages.t 89 ^ Albanian 
nouns are categorized by gender (masculine, feminine and neuter) and inflected for number (singular and plural) and case. There are five 
declensions and six cases (nominative, accusative, genitive, dative, ablative, and vocative), although the vocative only occurs with a limited 
number of words, and the forms of the genitive and dative are identical (a genitive is produced when the prepositions i/e/te/se are used with 
the dative). Some dialects also retain a locative case, which is not present in standard Albanian. The cases apply to both definite and 
indefinite nouns, and there are numerous cases of s yncretism . 

The following shows the declension of mal (mountain), a masculine noun which takes "i" in the definite singular: 



Indefinite singular 

Indefinite plural 

Definite singular 

Definite plural 

Nominative 

nje mal (a mountain) 

male (mountains) 

mall (the mountain) 

malet (the mountains) 

Accusative 

nje mal 

male 

malin 

malet 

Genitive 

i/e/te/se nje mall 

i/e/te/se maleve 

i/e/te/se malit 

i/e/te/se maleve 

Dative 

nje mali 

mal eve 

malit 

maleve 

Ablative 

(prej) nje mall 

(prej) malesh 

(prej) malit 

(prej) maleve 


The following shows the declension of the masculine noun zog (bird), a masculine noun which takes "u" in the definite singular: 







Indefinite singular 

Indefinite plural 

Definite singular 

Definite plural 

Nominative 

nje zog (a bird) 

zogj (birds) 

zogu (the bird) 

zogjte (the birds) 

Accusative 

nje zog 

zogj 

zogun 

zogjte 

Genitive 

i/e/te/se nje zogu 

i/e/te/se zogjve 

i/e/te/se zogut 

i/e/te/se zogjve 

Dative 

nje zogu 

zogjve 

zogut 

zogjve 

Ablative 

(prej) nje zogu 

(prej) zogjsh 

(prej) zogut 

(prej) zogjve 


The following table shows the declension of the feminine noun vajze (girl): 



Indefinite singular 

Indefinite plural 

Definite singular 

Definite plural 

Nominative 

nje vajze (a girl) 

vajza (girls) 

vajza (the girl) 

vajzat (the girls) 

Accusative 

nje vajze 

vajza 

vajzen 

vajzat 

Genitive 

i/e/te/se nje vajze 

i/e/te/se vajzave 

i/e/te/se vajzes 

i/e/te/se vajzave 

Dative 

nje vajze 

vajzave 

vajzes 

vajzave 

Ablative 

(prej) nje vajze 

(prej) vajzash 

(prej) vajzes 

(prej) vajzave 


The definite article is placed after the noun as in many other Balka n langua ges, like in Romanian, Macedonian and Bulgarian. 

■ The definite article can be in the form of noun suffixes, which vary with gender and case. 

■ For example, in singular nominative, masculine nouns add -/, or those ending in -g/-k/-h take -u (to avoid palatalization): 

■ mal (mountain) / mali (the mountain); 

■ liber (book) / libri (the book); 

■ zog (bird) / zogu (the bird). 

■ Feminine nouns take the suffix -(i/j)a: 

■ veture (car) / vetura (the car); 

■ shtepi (house) / shtepia (the house); 

■ lule (flower) / lulja (the flower). 

■ Neuter nouns take -t. 

Albanian has developed an analytical verbal structure in place of the earlier synthetic system, inherited from Proto-Indo-European. Its 
complex system of moods (six types) and tenses (three simple and five complex constructions) is distinctive among Bal kan language s. There 
are two general types of conjugations. 

Albanian verbs, like those of other Balkan languages, have an "admirative" mood ( menyra habitore ) that is used to indicate surprise on the 
part of the speaker or to imply that an event is known to the speaker by report and not by direct observation. In some contexts, this mood can 
be translated using English "apparently". 

■ Ti flet shqip. "You speak Albanian." (indicative) 

■ Ti fliske shqip! "You (surprisingly) speak Albanian!" (admirative) 

■ Rruga eshte e mbyllur. "The street is closed." (indicative) 

■ Rruga qenka e mbyllur. "(Apparently,) The street is closed." (admirative) 

For more information on verb conjugation and on inflection of other parts of speech, see Albanian morphology. 

Word order 

Albanian word order is relatively free. To say 'Agim ate all the oranges' in Albanian, one may use any of the following orders, with slight 
pragmatic differences: 

■ SVO: Agimi i hengri te gjithe portokallet. 

■ SOV: Agimi te gjithe portokallet i hengri. 

■ OVS: Te gjithe portokallet i hengri Agimi. 

■ OSV: Te gjithe portokallet Agimi i hengri. 

■ VSO: I hengri Agimi te gjithe portokallet. 

However, the most common order is subject-verb-object. 

The verb can optionally occur in sentence-initial position, especially with verbs in the non-active form ( forma joveprore ): 

■ Parashikohet nje nderprerje "An interruption is anticipated". 





Negation 

Verbal negation in Albanian is mood-dependent, a trait shared with some fellow Indo-European languages such as Greek. 

In indicative, conditional, or admirative sentences, negation is expressed by the particles nuk or s' in front of the verb, for example: 

■ Toni nuk flet anglisht "Tony does not speak English"; 

■ Toni s'flet anglisht "Tony doesn't speak English"; 

■ Nuk e di "I do not know"; 

■ S'e di "I don't know". 

Subjunctive, imperative, optative, or non-finite forms of verbs are negated with the particle mos: 

■ Mos harm "do not forget!". 

Numerals 


nje—one 

tetembedhjete—eighteen 

dy—two 

nentembedhjete—nineteen 

tri/tre—three 

njezet—twenty 

kater—four 

njezetenje—twenty-one 

pese—five 

njezetedy—twenty-two 

gjashte—six 

tridhjete—thirty 

shtate—seven 

dyzet/kate rd hj ete—fo rty 

fete—eight 

pesedhjete—fifty 

nente—nine 

gjashtedhjete—sixty 

dhjete—ten 

shtatedhjete—seventy 

njembedhjete—eleven 

tetedhjete—eighty 

dymbedhjete—twelve 

nentedhjete—ninety 

trembedhjete—thirteen 

njeqind—one hundred 

katermbedhj ete—f o u rte e n 

peseqind—five hundred 

pesembedhjete—fifteen 

njemije—one thousand 

gjashtembedhjete—sixteen 

nje milion—one million 

shtatembedhjete—seventeen 

nje miliard—one billion 


Literary tradition 


Earliest undisputed texts 

The earliest known texts in Albanian: 

■ the "formula e pagezimit" (Baptismal Formula), which dates back to 1462 and was 
authored by Pal Engjelli (or Paulus Angelus) (c. 1417 - 1470), Archbishop of Durres. 
Engjelli was a close friend and counsellor of Skanderbeg.! 90 ! It was written in a pastoral 
letter for a synod at the Holy Trinity in Mat and read in Latin characters as follows: Unte 
paghesont premenit Atit et Birit et Spertit Senit (standard Albanian: "Une te pagezoj ne 
emer te Atit, te Birit e te Shpirtit te Shenjte"; English: "I baptize you in the name of the 
Father and the Son and the Holy Spirit"). It was discovered and published in 1915 by 
Nicolae lorga.! 91 ! 

■ the Fjalori i Arnold von Harfit (Arnold Ritter von Harff's lexicon), a short list of Albanian 
phrases with German glosses, dated 1496.! 92 ! 

■ a song, recorded in the Greek alphabet, retrieved from an old codex that was written in 
Greek. The document is also called "Perikopeja e Ungjillit te Pashkeve" or "Perikopeja e 
Ungjillit te Shen Mateut" ("The Song of the Easter Gospel, or "The Song of Saint 
Matthew's Gospel"). Although the codex is dated to during the 14th century, the song, 
written in Albanian by an anonymous writer, seems to be a 16th-century writing. The 
document was found by Arbereshe people who had emigrated to Italy in the 15th 
centuryJ 93 ! 
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Meshari of Gjon Buzuku 1554-1555 





■ The first book in Albanian is the Meshari ("The Missal"), written by Gjon Buzuku between 20 March 1554 and 5 January 
1555. The book was written in the Gheg dialect in the Latin script with some Slavic letters adapted for Albanian vowels. The 
book was discovered in 1740 by Gjon Nikolle Kazazi, the Albanian archbishop of Skopje . It contains the liturgies of the main 
holidays. There are also texts of prayers and rituals and catechetical texts. The grammar and the vocabulary are more 
archaic than those in the Gheg texts from the 17th century. The 188 pages of the book comprise about 154,000 words with a 
total vocabulary of c. 1,500 different words. The text is archaic yet easily interpreted because it is mainly a translation of 
known texts, in particular portions of the Bible. The book also contains passages from the Psalms, the Book of Isaiah, the 
Book of Jeremiah, the Letters to the Corinthians, and many illustrations. The uniformity of spelling seems to indicate an 
earlier tradition of writing. The only known copy of the Meshari is held by the Apostolic Library.^ 94 ] In 1968 the book was 
published with transliterations and comments by linguists. 

Albanian scripts were produced earlier than the first attested document, "formula e pagezimit", but none yet have been discovered. We know 
of their existence by earlier references. For example, a French monk signed as "Broccardus" notes, in 1332, that "Although the Albanians 
have another language totally different from Latin, they still use Latin letters in all their books' 'J 95 ^ 


Disputed earlier texts 


In 1967 two scholars claimed to have found a brief text in Albanian inserted into the Bellifortis text, 
a book written in Latin dating to 1402-1405.^ 


"A star has fallen in a place in the woods, distinguish the star, distinguish it. 

Distinguish the star from the others, they are ours, they are. 

Do you see where the great voice has resounded? Stand beside it 

That thunder. It did not fall. It did not fall for you, the one which would do it. 

Like the ears, you should not believe ... that the moon fell when ... 

Try to encompass that which spurts far ... 

Call the light when the moon falls and no longer exists ..." 


Dr. Robert Elsie, a specialist in Albanian studies, considers that "The Todericiu/Polena Romanian 
translation of the non-Latin lines, although it may offer some clues if the text is indeed Albanian, is 
fanciful and based, among other things, on a false reading of the manuscript, including the exclusion 
of a whole line."] 97 ] 
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Possibly the oldest surviving 
Albanian text, highlighted in red, 
from the Bellifortis manuscript, 
written by Konrad Kyeser around 
1402-1405. 


Ottoman period 

In 1635, Frang Bardhi (1606-1643) published in Rome his Dictionarum latinum-epiroticum, the first known Latin-Albanian dictionary. 
Other scholars who studied the language during the 17th century include Andrea Bogdani (1600-1685), author of the first Latin-Albanian 
grammar book, Nilo Katalanos (1637-1694) and others.] 98 ] 


Lexicon 


Albanian is known within historical linguistics as a case of a language which, although surviving through many periods of foreign rule and 
multilingualism, saw a "disproportionately high" influx of loans from other languages augmenting and replacing much of its original 
vocabulary.]"] Some scholars suggest that Albanian seems to have lost more than 90% of its original vocabulary in favour of Latin, Greek, 
Slavic, Italian and Turkish loanwords,] 10 °] but according to other scholars this percentage is definitely overstated.] 101 ] Of all the foreign 
influences in Albanian, the deepest reaching and most impactful was the absorption of loans from Latin in the Classical period and its 
Romance successors afterward, with over 60% of Albanian vocabulary consisting of Latin roots, causing Albanian to once have been 
mistakenly identified as a Romance language.] 102 ] 

Major work in reconstructing Proto-Albanian has been done with the help of knowledge of the original forms of loans from Ancient Greek, 
Latin and Slavic, while Ancient Greek loanwords are scarce the Latin loanwords are of extreme importance in phonology .] 103 ] The presence 
of loanwords from more well-studied languages from time periods before Albanian was attested, reaching deep back into the Classical Era, 
has been of great use in phonological reconstructions for earlier ancient and medieval forms of Albanian.]"] Some words in the core 
vocabulaiy of Albanian have no known etymology linking them to Proto-Indo-European or any known source language, and as of 2018 are 
thus tentatively attributed to an unknown, unattested, pre-Indo-European substrate language; some words among these include zemer (heart) 
and hekur (iron).] 104 ] Some among these putative pre-IE words are thought to be related to putative pre-IE substrate words in neighboring 
Indo-European languages, such as lule (flower), which has been tentatively linked to Latin lilia and Greek leirionJ 105 ^ 

Lexical distance of Albanian in a lexicostatistical analysis of the Ukrainian linguist Tyshchenko (the lower figure - the higher similarity): 
49% Slovenian, 53% Romanian, 56% Greek, 82% French, 86% Macedonian, 86% Bulgarian.] 106 ]] 107 ] 



















Cognates with Illyrian 

■ Andena/Andes/Andio/Antis — personal Illyrian names based on a root-word and- or ant-, found in both the southern and the 
Dalmatian-Pannonian (including modern Bosnia and Herzegovina) onomastic provinces; cf. Alb. ande (northern Albanian 
dialect, or Gheg) and ende (southern Albanian dialect or Tosk) "appetite, pleasure, desire, wish"; Andi proper name, 
Andizetes, an Illyrian people inhabiting the Roman province of Panonia.! 108 ! 

■ aran "field"; cf. Alb. are; plural ara! 109 ! 

■ Ardiaioi/Ardiaei, name of an Illyrian people, cf. Alb. ardhja "arrival" or "descent", connected to hardhi "vine-branch, grape¬ 
vine", with a sense development similar to Germanic *stamniz, meaning both stem, tree stalk and tribe, lineage. However, 
the insufficiency of this hypothesis is that so far there is no certainty as to the historical or etymological development of 
either ardhja/hardhi or Ardiaioi, as with many other words.I 108 ! 

■ Bilia "daughter"; cf. Alb. bije, dial, to/'/e! 110 ! 

■ Bindo/Bindus, an Illyrian deity from Bihac, Bosnia and Herzegovina; cf. Alb. bind "to convince" or "to make believe", 
perbindesh "monster".I 111 ] 

■ bounon, "hut, cottage"; cf. Alb bun I 112 ! 

■ brisa, "husk of grapes"; cf. Alb bersi "lees, dregs; mash" ( < PA *brutiaf 113 ^ 

■ Barba- "swamp", a toponym from Metubarbis ; possibly related to Alb. berrake "swampy soil"! 113 ] 

■ can- "dog"; related to Alb. gen! 113 ! 

■ Daesitiates, a name of an Illyrian people, cf. Alb. dash "ram", corresponding contextually with south Slavonic dasa "ace", 
which might represent a borrowing and adaptation from Illyrian (or some other ancient language).I 108 ] 

■ mal, "mountain"; cf. Alb mal I 114 ] 

■ bardi, "white"; cf. Alb bardhe I 115 ! 

■ drakoina "supper"; cf. Alb. darke, dreke ! 2 °! 

■ drenis, "deer"; cf. Alb dre, dren/! 112 ! 

■ delme "sheep"; cf. Alb dele, Gheg dialect de/me! 116 ! 

■ dard, "pear"; cf. Alb dardhe^ 117 1 

■ Hyllus (the name of an Illyrian king); cf. Alb. yll ( hyll in some northern dialects) "star", also Alb. hyj "god", Ylli proper 
name.! 20 ! 

■ slca, "dagger"; cf. Alb thike or thika "knife"! 118 ] 

■ Ulc-, "wolf" (pin. Ulcinium ); cf. Alb ujk "wolf", ulk (Northern Dialect)! 119 ! 

■ lougeon, "pool"; cf. Alb lag, legen "to wet, soak, bathe, wash" ( < PA *lauga), legate "pool" ( < PA *leugata ), lakshte "dew" ( < 
PA /aug/'sfa)! 120 ! 

■ mag- "great"; cf. Alb. i madh "big, great"! 113 ! 

■ mantfa "bramblebush"; Old and dial. Alb mande "berry, mulberry" (mod. Alb men, man ) 

■ rhinos, "fog, mist"; cf. Old Alb ren "cloud" (mod. Alb re, re) ( < PA *dna)! 121 ! 

■ Vendum "place"; cf. Proto-Alb. wen-ta (Mod. Alb. vend)! 20 ! 

Early linguistic influences 

The earliest loanwords attested in Albanian come from Doric Greek,! 122 ! w hereas the strongest influence came from Latin. According to 
Matthew C. Curtis, the loanwords do not necessarily indicate the geographical location of the ancestor of Albanian language.! 123 ! However, 
according to other linguists, the borrowed words can help to get an idea about the place of origin and the evolution of the Albanian 
language.! 124 !! 125 ! Some scholars argue that Albanian originated from an area located east of its present geographic spread due to the several 
common lexical items found between the Albanian and Romanian languages. However it does not necessarily define the genealogical histoiy 
of Albanian language, and it does not exclude the possibility of Proto-Albanian presence in both Illyria n and Thracian territory.! 126 ! 

The period during which Proto-Albanian and Latin interacted was protracted, lasting from the 2nd centuiy BC to the 5th centuiy AD.! 57 ! 
Over this period, the lexical borrowings can be roughly divided into three layers, the second of which is the largest. The first and smallest 
occurred at the time of less significant interaction. The final period, probably preceding the Slavic or Germanic invasions, also has a notably 
smaller number of borrowings. Each layer is characterized by a different treatment of most vowels: the first layer follows the evolution of 
Early Proto-Albanian into Albanian; while later layers reflect vowel changes endemic to Late Latin (and presumably Proto-Romance). Other 
formative changes include the syncretism of several noun case endings, especially in the plural, as well as a large-scale palatalization. 

A brief period followed, between the 7th and the 9th centuries, that was marked by heavy borrowings from Southern Slavic, some of which 
predate the "o-a" shift common to the modern forms of this language group. Starting in the latter 9th century, there was a period 
characterized by protracted contact with the Proto-Romanians, or Vlachs, though lexical borrowing seems to have been mostly one sided: 
from Albanian into Romanian. Such borrowing indicates that the Romanians migrated from an area where the majority was Slavic (i.e. 
Middle Bulgarian) to an area with a majority of Albanian speakers (i.e. Dardania, where Vlachs are recorded in the 10th century). Their 
movement is presumably related to the expansion of the Bulgarian Empire into Albania around that time. 


Early Greek loans 




There are some 30 Ancient Greek loanwords in Albanian.! 127 ! Many of these reflect a dialect which voiced its aspirants, as did the 
Macedonian dialect. Other loanwords are Doric; these words mainly refer to commodity items and trade goods and probably came through 
trade with a now-extinct intermediary.! 122 ! 

■ blete; "hive, bee" < Attic melitta "bee" (vs. Ionic melissa )} 128 1 

■ draper; "sickle" < (NW) drapanon ! 129 ! 

■ kumbull; "plum" < kokkumelon I 129 ! 

■ laker; "cabbage, green vegetables" < lachanon "green; vegetable"! 130 ! 

■ lepjete; "orach, dock" < lapathon^ 131 ^ 

■ leva (lyej); "to smear, oil" < *liwenj < *elaiwa < Gk elai(w)on "oil" 

■ moker; "millstone" < (NW) machana "device, instrument"! 127 ! 

■ molle; "apple" < melon "fruit"! 132 ! 

■ pjeper; "melon" < pepon 

■ presh; "leek” < prason ! 13 °! 

■ shpelle; "cave" < spelaion 

■ trumze; "thyme" < (NW) thymbra, thrymbre ! 129 ! 


Latin influence 

In total Latin roots comprise over 60% of the Albanian lexicon.! 102 ! They include many frequently used core vocabulary items, including 
shume ("very", from Latin summus), pak ("few", Latin paucus), ngushte ("narrow", Latin angustus), peme ("tree", Latin poma), vij ("to 
come", Latin venio), rere ("sand", Latin arena), drejt ("straight", Latin "directus"), kafshe ("beast", Latin causa, meaning "thing"), and larg 
("far away", Latin largus). 

Jernej Kopitar (1780-1844) was the first to note Latin's influence on Albanian and claimed "the Latin loanwords in the Albanian language 
had the pronunciation of the time of Emperor Augustus".! 133 ! Kopitar gave examples such as Albanian qiqer 'chickpea' from Latin cicer, 
qytet 'city, town' from civitas, peshk 'fish' from piscis, and shigjete 'arrow' from sagitta. The hard pronunciations of Latin (c) and (g) are 
retained as palatal and velar stops in the Albanian loanwords. Gustav Meyer (1888)! 134 ! and Wilhelm Meyer-Liibke (1914)! 135 ! later 
corroborated this. Meyer noted the similarity between the Albanian verbs shqipo] "to speak clearly, enunciate" and shqipto] "to pronounce, 
articulate" and the Latin word excipio (meaning "to welcome"). Therefore, he believed that the word Shqiptar "Albanian person" was 
derived from shqipo], which in turn was derived from the Latin word excipere. Johann Georg von Hahn, an Austrian linguist, had proposed 
the same hypothesis in 1854.! 136 ! 

Eqrem Qabej also noticed, among other things, the archaic Latin elements in Albanian:! 137 ! 

1. Latin /au/ becomes Albanian lal in the earliest loanwords: aurum - ar 'gold'; gaudium -> gaz 'joy'; laurus - lar laurel'. Latin 
/au/ is retained in later loans, but is altered in a way similar to Greek: causa 'thing' - kafshe 'thing; beast, brute'; laud -> 
lavd. 

2. Latin /o:/ becomes Albanian lei in the oldest Latin loans: pomus -> peme 'fruit tree'; bora - ora 'hour'. An analogous 
mutation occurred from Proto-Indo-European to Albanian; PIE *nds became Albanian ne 'we', PIE *oktd + suffix -ti- became 
Albanian fete 'eight', etc. 

3. Latin unstressed internal and initial syllables become lost in Albanian: cubitus - kub 'elbow'; medicus - mjek 'physician'; 
paludem 'swamp' - VL padule - pyll 'forest'. An analogous mutation occurred from Proto-Indo-European to Albanian. In 
contrast, in later Latin loanwords, the internal syllable is retained: paganus - pagan ; plaga -> plage 'wound', etc. 

4. Latin /tj/, /dj/, /kj/ palatalized to Albanian Isl, /z/, /c/: vitium - ves 'vice; worries'; rationem -> arsye 'reason'; radius - rreze 
'ray; spoke'; facies - faqe 'face, cheek'; socius - shok 'mate, comrade', shoq 'husband', etc. In turn, Latin Isl was altered to 
/J7 in Albanian. 

Haralambie Mihaescu demonstrated that: 

■ Some 85 Latin words have survived in Albanian but not (as inherited) in any Romance language. A few examples include 
Late Latin celsydri -* dial, kulsheder - kugeder 'hydra', hfbernus - verri 'winter pasture', sarcinarius 'used for packing, 
loading' - shelqeror 'forked peg, grapnel, forked hanger', so/anum 'nightshade', lit. 'sun plant’ - shulle(r) 'sunny place out 
of the wind, sunbathed area', spleneticus - shpretke 'spleen', trifurcus -> terfurk 'pitchfork'.! 138 ! 

■ 151 Albanian words of Latin origin were not inherited in Romanian. A few examples include Latin amicus - Albanian mik 
'friend', inimicus - armik 'foe, enemy', rationem -> arsye, benedicere -► bekoj, bubulcus 'ploughman, herdsman' -> bulk, 
bujk 'peasant', calicis - qelq 'drinking glass', castellum -> keshtjelle 'castle', centum -> qind 'hundred', gallus -► gjel 
'rooster', iunctura -> gjymtyre 'limb; joint', medicus - mjek 'doctor', retem -> rrjete 'net', sperare -* dial. shp(e)rej shpresoj 
'to hope' pres 'await', voluntas ( voluntatis ) - vullnet 'will; volunteer'.! 139 1 

■ Some Albanian church terminology has phonetic features which demonstrate their very early borrowing from Latin. A few 
examples include Albanian bekoj 'to bless' from benedicere, engjell 'angel' from angelus, kishe 'church' from ecclesia, i 
krishtere 'Christian' from christianus, kryq 'cross' from crux ( crucis ), (obsolete) Iter 'altar' from Latin altarium, mallkoj 'to 
curse' from maledicere, meshe 'mass' from missa, murg ’monk' from monacus, peshkep 'bishop' from episcopus, and ungjill 
'gospel' from evangelium J 140 ! 




Other authors! 141 ! have detected Latin loanwords in Albanian with an ancient sound pattern from the 1st centuiy BC, for example, Albanian 
qingel(e) 'saddle girth; dwarf elder' from Latin cingula and Albanian e vjeter 'old, aged; former' from vjet but influenced by Latin veteris. 
The Romance languages inherited these words from Vulgar Latin: cingula became Romanian chinga 'girdle; saddle girth', and Vulgar Latin 
veteranus became Romanian batran 'old'. 

Albanian, Bas que, and the surviving Celtic language s such as Breton and Welsh are the non-Romance languages today that have this sort of 
extensive Latin element dating from ancient Roman times, which has undergone the sound changes associated with the languages. Other 
languages in or near the former Roman area either came on the scene later (Turkish, the Slavic languages, Arabic) or borrowed little from 
Latin despite coexisting with it (Greek, German), although German does have a few such ancient Latin loanwords ( Fenster 'window', Kase 
'cheese', Kolri). 

Romanian scholars such as Vatasescu and Mihaescu, using lexical analysis of the Albanian language, have concluded that Albanian was 
heavily influenced by an extinct Romance language that was distinct from both Romanian and Dalmatian. Because the Latin words common 
to only Romanian and Albanian are significantly fewer in number than those that are common to only Albanian and Western Romance, 
Mihaescu argues that the Albanian language evolved in a region with much greater contact with Western Romance regions than with 
Romanian-speaking regions, and located this region in present-day Albania, Kosovo and Western Macedonia, spanning east to Bitola and 
Pristina.! 142 ! 

Gothic loans 

Some Gothic loanwords were borrowed through Late Latin, while others came from the Ostrogothic expansion into parts of Praevalitana 
around Niksic and the Gulf of Kotor in Montenegro. 

■ fat; "groom, husband" < Goth brujofafos "bridegroom"! 143 ! 

■ horr; "scoundrel", horre; "hussy, whore" < Goth hors "adulterer", *hora "whore" 

■ shkulke; "boundary marker for pastures made of branches" < Late Latin sculca < Goth skulka "guardian" 

■ shkume; "foam" < Late Latin < Goth skuma 

■ tirq; "trousers" < Late Latin tubrucus < Goth *ftiobrok "knee-britches”; cf. OHG dioh-bruoh, Eng thigh, breeches 

Other loans 

It is assumed that Greek and Balkan Latin (the ancestor of Romanian and other Balkan Romance languages) exerted a great influence on 
Albanian. Examples of words borrowed from Latin: qytet < civitas (city), qiell < caelum (sky), mik < amicus (friend), kape diten < carpe 
diem (seize the day). 

After the Slavs arrived in the Balkans, the Slavic languages became an additional source of loanwords. The rise of the Ottoman Empire 
meant an influx of Turkish words; this also entailed the borrowing of Persian and Arabic words through Turkish. Some Turkish personal 
names, such as Altin, are common. There are some loanwords from Modern Greek, especially in the south of Albania. Many borrowed words 
have been replaced by words with Albanian roots or modern Latinized (international) words. 

Patterns in loaning 

Although Albanian is characterized by the absorption of many loans, even, in the case of Latin, reaching deep into the core vocabulary, 
certain semantic fields nevertheless remained more resistant. Terms pertaining to social organization are often preserved, though not those 
pertaining to political organization, while those pertaining to trade are all loaned or innovated.! 144 ! 

Hydronyms present a complicated picture; the term for "sea" (det) is native and an "Albano-Germanic" innovation referring to the concept of 
depth, but a large amount of maritime vocabulary is loaned. Words referring to large streams and their banks tend to be loans, but lume 
("river") is native, as is rryme (the flow of water). Words for smaller streams and stagnant pools of water are more often native, but the word 
for "pond", pellg is in fact a semantically shifted descendant of the old Greek word for "high sea", suggesting a change in location after 
Greek contact. Albanian has maintained since Proto-Indo-European a specific term referring to a riverside forest ( gjaze ), as well as its words 
for marshes. Curiously, Albanian has maintained native terms for "whirlpool", "water pit" and (aquatic) "deep place", leading Orel to 
speculate that the Albanian Urheimat likely had an excess of dangerous whirlpools and depths.! 145 ! 

Regarding forests, words for most conifers and shrubs are native, as are the terms for "alder", "elm", "oak", "beech", and "linden", while 
"ash", "chestnut", "birch", "maple", "poplar", and "willow" are loans.! 146 ! 

The original kinship terminology of Indo-European was radically reshaped; changes included a shift from "mother" to "sister", and were so 
thorough that only three terms retained their original function, the words for "son-in-law", "mother-in-law" and "father-in-law". All the 
words for second-degree blood kinship, including "aunt", "uncle", "nephew", "niece", and terms for grandchildren, are ancient loans from 
Latin.! 147 ! 

The Proto-Albanians appear to have been cattle breeders given the vastness of preserved native vocabulary pertaining to cow breeding, 
milking and so forth, while words pertaining to dogs tend to be loaned. Many words concerning horses are preserved, but the word for horse 
itself is a Latin loan.! 148 ! 



See also 


■ Abetare 

■ Arberesh language 

■ Arvanitika 

■ Gheg Albanian 

■ Illyrian languages 

■ IPA/Albanian 

■ Messapian language 

■ Thraco-lllyrian 

■ Tosk Albanian 

Notes 


A Co-official language. 

a. A Kosovo is the subject of a territorial dispute between the Republic of Kosovo and the Republic of Serbia. The Republic 
of Kosovo unilaterally declared independence on 17 February 2008, but Serbia continues to claim it as part of its own 
sovereign territory. The two governments began to normalise relations in 2013, as part of the 2013 Brussels Agreement. 
Kosovo is currently recognized as an independent state by 97 out of the 193 United Nations member states. In total, 112 
UN member states recognized Kosovo at some point, of which 15 later withdrew their recognition. 
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